Overview

Dataset statistics

Number of variables50
Number of observations101766
Missing cells374017
Missing cells (%)7.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory215.4 MiB
Average record size in memory2.2 KiB

Variable types

CAT36
NUM13
BOOL1

Warnings

examide has constant value "101766" Constant
citoglipton has constant value "101766" Constant
medical_specialty has a high cardinality: 72 distinct values High cardinality
diag_1 has a high cardinality: 716 distinct values High cardinality
diag_2 has a high cardinality: 748 distinct values High cardinality
diag_3 has a high cardinality: 789 distinct values High cardinality
race has 2273 (2.2%) missing values Missing
weight has 98569 (96.9%) missing values Missing
payer_code has 40256 (39.6%) missing values Missing
medical_specialty has 49949 (49.1%) missing values Missing
diag_3 has 1423 (1.4%) missing values Missing
max_glu_serum has 96420 (94.7%) missing values Missing
A1Cresult has 84748 (83.3%) missing values Missing
number_emergency is highly skewed (γ1 = 22.85558215) Skewed
encounter_id has unique values Unique
num_procedures has 46652 (45.8%) zeros Zeros
number_outpatient has 85027 (83.6%) zeros Zeros
number_emergency has 90383 (88.8%) zeros Zeros
number_inpatient has 67630 (66.5%) zeros Zeros

Reproduction

Analysis started2020-10-03 23:44:42.415603
Analysis finished2020-10-03 23:45:19.844656
Duration37.43 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

encounter_id
Real number (ℝ≥0)

UNIQUE

Distinct101766
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean165201645.6
Minimum12522
Maximum443867222
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum12522
5-th percentile27170784
Q184961194
median152388987
Q3230270887.5
95-th percentile378962843
Maximum443867222
Range443854700
Interquartile range (IQR)145309693.5

Descriptive statistics

Standard deviation102640296
Coefficient of variation (CV)0.6213031087
Kurtosis-0.1020713932
Mean165201645.6
Median Absolute Deviation (MAD)70921143
Skewness0.6991415513
Sum1.681191067e+13
Variance1.053503036e+16
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
962109421< 0.1%
 
899438461< 0.1%
 
3843069861< 0.1%
 
946501561< 0.1%
 
831567841< 0.1%
 
26744821< 0.1%
 
2813458441< 0.1%
 
1936162741< 0.1%
 
3555080241< 0.1%
 
1659738181< 0.1%
 
1252789441< 0.1%
 
4208731881< 0.1%
 
1572411541< 0.1%
 
1611610321< 0.1%
 
1748553901< 0.1%
 
1349507341< 0.1%
 
1541282101< 0.1%
 
969931081< 0.1%
 
1220641441< 0.1%
 
2977708401< 0.1%
 
3826126161< 0.1%
 
1651341721< 0.1%
 
1082448301< 0.1%
 
2105787661< 0.1%
 
4438423401< 0.1%
 
Other values (101741)101741> 99.9%
 
ValueCountFrequency (%) 
125221< 0.1%
 
157381< 0.1%
 
166801< 0.1%
 
282361< 0.1%
 
357541< 0.1%
 
369001< 0.1%
 
409261< 0.1%
 
425701< 0.1%
 
558421< 0.1%
 
622561< 0.1%
 
ValueCountFrequency (%) 
4438672221< 0.1%
 
4438571661< 0.1%
 
4438541481< 0.1%
 
4438477821< 0.1%
 
4438475481< 0.1%
 
4438471761< 0.1%
 
4438427781< 0.1%
 
4438423401< 0.1%
 
4438421361< 0.1%
 
4438420701< 0.1%
 

patient_nbr
Real number (ℝ≥0)

Distinct71518
Distinct (%)70.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean54330400.69
Minimum135
Maximum189502619
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum135
5-th percentile1456971.75
Q123413221
median45505143
Q387545949.75
95-th percentile111480273
Maximum189502619
Range189502484
Interquartile range (IQR)64132728.75

Descriptive statistics

Standard deviation38696359.35
Coefficient of variation (CV)0.7122413759
Kurtosis-0.3473720444
Mean54330400.69
Median Absolute Deviation (MAD)32950134
Skewness0.4712807224
Sum5.528987557e+12
Variance1.497408227e+15
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
8878589140< 0.1%
 
4314090628< 0.1%
 
2319902123< 0.1%
 
166029323< 0.1%
 
8822754023< 0.1%
 
2364340522< 0.1%
 
8442861322< 0.1%
 
9270935121< 0.1%
 
2339848820< 0.1%
 
9060980420< 0.1%
 
8878970720< 0.1%
 
3709686620< 0.1%
 
8947240220< 0.1%
 
2990387720< 0.1%
 
8868195019< 0.1%
 
8847903619< 0.1%
 
9739100719< 0.1%
 
2401157718< 0.1%
 
348127218< 0.1%
 
9116028018< 0.1%
 
8434879218< 0.1%
 
340105518< 0.1%
 
9175112118< 0.1%
 
10675747817< 0.1%
 
9048919517< 0.1%
 
Other values (71493)10124599.5%
 
ValueCountFrequency (%) 
1352< 0.1%
 
3781< 0.1%
 
7291< 0.1%
 
7741< 0.1%
 
9271< 0.1%
 
11525< 0.1%
 
13051< 0.1%
 
13143< 0.1%
 
16291< 0.1%
 
20251< 0.1%
 
ValueCountFrequency (%) 
1895026191< 0.1%
 
1894814781< 0.1%
 
1894451271< 0.1%
 
1893658641< 0.1%
 
1893510951< 0.1%
 
1893494301< 0.1%
 
1893320871< 0.1%
 
1892988771< 0.1%
 
1892578462< 0.1%
 
1892157621< 0.1%
 

race
Categorical

MISSING

Distinct5
Distinct (%)< 0.1%
Missing2273
Missing (%)2.2%
Memory size795.2 KiB
Caucasian
76099 
AfricanAmerican
19210 
Hispanic
 
2037
Other
 
1506
Asian
 
641
ValueCountFrequency (%) 
Caucasian7609974.8%
 
AfricanAmerican1921018.9%
 
Hispanic20372.0%
 
Other15061.5%
 
Asian6410.6%
 
(Missing)22732.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length15
Median length9
Mean length9.894178802
Min length3

Overview of Unicode Properties

Unique unicode characters17
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a27166827.0%
 
n12174312.1%
 
i11923411.8%
 
c11655611.6%
 
s787777.8%
 
C760997.6%
 
u760997.6%
 
r399264.0%
 
A390613.9%
 
e207162.1%
 
f192101.9%
 
m192101.9%
 
H20370.2%
 
p20370.2%
 
O15060.1%
 
t15060.1%
 
h15060.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter88818888.2%
 
Uppercase Letter11870311.8%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C7609964.1%
 
A3906132.9%
 
H20371.7%
 
O15061.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a27166830.6%
 
n12174313.7%
 
i11923413.4%
 
c11655613.1%
 
s787778.9%
 
u760998.6%
 
r399264.5%
 
e207162.3%
 
f192102.2%
 
m192102.2%
 
p20370.2%
 
t15060.2%
 
h15060.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin1006891100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a27166827.0%
 
n12174312.1%
 
i11923411.8%
 
c11655611.6%
 
s787777.8%
 
C760997.6%
 
u760997.6%
 
r399264.0%
 
A390613.9%
 
e207162.1%
 
f192101.9%
 
m192101.9%
 
H20370.2%
 
p20370.2%
 
O15060.1%
 
t15060.1%
 
h15060.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1006891100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a27166827.0%
 
n12174312.1%
 
i11923411.8%
 
c11655611.6%
 
s787777.8%
 
C760997.6%
 
u760997.6%
 
r399264.0%
 
A390613.9%
 
e207162.1%
 
f192101.9%
 
m192101.9%
 
H20370.2%
 
p20370.2%
 
O15060.1%
 
t15060.1%
 
h15060.1%
 

gender
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
Female
54708 
Male
47055 
Unknown/Invalid
 
3
ValueCountFrequency (%) 
Female5470853.8%
 
Male4705546.2%
 
Unknown/Invalid3< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length15
Median length6
Mean length5.075496728
Min length4

Overview of Unicode Properties

Unique unicode characters16
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
e15647130.3%
 
a10176619.7%
 
l10176619.7%
 
F5470810.6%
 
m5470810.6%
 
M470559.1%
 
n12< 0.1%
 
U3< 0.1%
 
k3< 0.1%
 
o3< 0.1%
 
w3< 0.1%
 
/3< 0.1%
 
I3< 0.1%
 
v3< 0.1%
 
i3< 0.1%
 
d3< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter41474180.3%
 
Uppercase Letter10176919.7%
 
Other Punctuation3< 0.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
F5470853.8%
 
M4705546.2%
 
U3< 0.1%
 
I3< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
e15647137.7%
 
a10176624.5%
 
l10176624.5%
 
m5470813.2%
 
n12< 0.1%
 
k3< 0.1%
 
o3< 0.1%
 
w3< 0.1%
 
v3< 0.1%
 
i3< 0.1%
 
d3< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/3100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin516510> 99.9%
 
Common3< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
e15647130.3%
 
a10176619.7%
 
l10176619.7%
 
F5470810.6%
 
m5470810.6%
 
M470559.1%
 
n12< 0.1%
 
U3< 0.1%
 
k3< 0.1%
 
o3< 0.1%
 
w3< 0.1%
 
I3< 0.1%
 
v3< 0.1%
 
i3< 0.1%
 
d3< 0.1%
 

Most frequent Common characters

ValueCountFrequency (%) 
/3100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII516513100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
e15647130.3%
 
a10176619.7%
 
l10176619.7%
 
F5470810.6%
 
m5470810.6%
 
M470559.1%
 
n12< 0.1%
 
U3< 0.1%
 
k3< 0.1%
 
o3< 0.1%
 
w3< 0.1%
 
/3< 0.1%
 
I3< 0.1%
 
v3< 0.1%
 
i3< 0.1%
 
d3< 0.1%
 

age
Categorical

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
[70-80)
26068 
[60-70)
22483 
[50-60)
17256 
[80-90)
17197 
[40-50)
9685 
Other values (5)
9077 
ValueCountFrequency (%) 
[70-80)2606825.6%
 
[60-70)2248322.1%
 
[50-60)1725617.0%
 
[80-90)1719716.9%
 
[40-50)96859.5%
 
[30-40)37753.7%
 
[90-100)27932.7%
 
[20-30)16571.6%
 
[10-20)6910.7%
 
[0-10)1610.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length8
Median length7
Mean length7.025863255
Min length6

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories4 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
020632528.9%
 
[10176614.2%
 
-10176614.2%
 
)10176614.2%
 
7485516.8%
 
8432656.1%
 
6397395.6%
 
5269413.8%
 
9199902.8%
 
4134601.9%
 
354320.8%
 
136450.5%
 
223480.3%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number40969657.3%
 
Open Punctuation10176614.2%
 
Dash Punctuation10176614.2%
 
Close Punctuation10176614.2%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
[101766100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
020632550.4%
 
74855111.9%
 
84326510.6%
 
6397399.7%
 
5269416.6%
 
9199904.9%
 
4134603.3%
 
354321.3%
 
136450.9%
 
223480.6%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-101766100.0%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)101766100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common714994100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
020632528.9%
 
[10176614.2%
 
-10176614.2%
 
)10176614.2%
 
7485516.8%
 
8432656.1%
 
6397395.6%
 
5269413.8%
 
9199902.8%
 
4134601.9%
 
354320.8%
 
136450.5%
 
223480.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII714994100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
020632528.9%
 
[10176614.2%
 
-10176614.2%
 
)10176614.2%
 
7485516.8%
 
8432656.1%
 
6397395.6%
 
5269413.8%
 
9199902.8%
 
4134601.9%
 
354320.8%
 
136450.5%
 
223480.3%
 

weight
Categorical

MISSING

Distinct9
Distinct (%)0.3%
Missing98569
Missing (%)96.9%
Memory size795.2 KiB
[75-100)
1336 
[50-75)
897 
[100-125)
625 
[125-150)
145 
[25-50)
 
97
Other values (4)
 
97
ValueCountFrequency (%) 
[75-100)13361.3%
 
[50-75)8970.9%
 
[100-125)6250.6%
 
[125-150)1450.1%
 
[25-50)970.1%
 
[0-25)48< 0.1%
 
[150-175)35< 0.1%
 
[175-200)11< 0.1%
 
>2003< 0.1%
 
(Missing)9856996.9%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length9
Median length3
Mean length3.154265668
Min length3

Overview of Unicode Properties

Unique unicode characters11
Unique unicode categories6 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
n19713861.4%
 
a9856930.7%
 
051721.6%
 
543681.4%
 
[31941.0%
 
-31941.0%
 
)31941.0%
 
129570.9%
 
722790.7%
 
29290.3%
 
>3< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter29570792.1%
 
Decimal Number157054.9%
 
Open Punctuation31941.0%
 
Dash Punctuation31941.0%
 
Close Punctuation31941.0%
 
Math Symbol3< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n19713866.7%
 
a9856933.3%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
[3194100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
0517232.9%
 
5436827.8%
 
1295718.8%
 
7227914.5%
 
29295.9%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-3194100.0%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)3194100.0%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
>3100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin29570792.1%
 
Common252907.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n19713866.7%
 
a9856933.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
0517220.5%
 
5436817.3%
 
[319412.6%
 
-319412.6%
 
)319412.6%
 
1295711.7%
 
722799.0%
 
29293.7%
 
>3< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII320997100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
n19713861.4%
 
a9856930.7%
 
051721.6%
 
543681.4%
 
[31941.0%
 
-31941.0%
 
)31941.0%
 
129570.9%
 
722790.7%
 
29290.3%
 
>3< 0.1%
 

admission_type_id
Real number (ℝ≥0)

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.024006053
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile6
Maximum8
Range7
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.44540283
Coefficient of variation (CV)0.7141296972
Kurtosis1.942476114
Mean2.024006053
Median Absolute Deviation (MAD)0
Skewness1.591984327
Sum205975
Variance2.08918934
MonotocityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%) 
15399053.1%
 
31886918.5%
 
21848018.2%
 
652915.2%
 
547854.7%
 
83200.3%
 
721< 0.1%
 
410< 0.1%
 
ValueCountFrequency (%) 
15399053.1%
 
21848018.2%
 
31886918.5%
 
410< 0.1%
 
547854.7%
 
652915.2%
 
721< 0.1%
 
83200.3%
 
ValueCountFrequency (%) 
83200.3%
 
721< 0.1%
 
652915.2%
 
547854.7%
 
410< 0.1%
 
31886918.5%
 
21848018.2%
 
15399053.1%
 

discharge_disposition_id
Real number (ℝ≥0)

Distinct26
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.715641766
Minimum1
Maximum28
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q34
95-th percentile18
Maximum28
Range27
Interquartile range (IQR)3

Descriptive statistics

Standard deviation5.280165509
Coefficient of variation (CV)1.421064204
Kurtosis6.003346764
Mean3.715641766
Median Absolute Deviation (MAD)0
Skewness2.563066993
Sum378126
Variance27.88014781
MonotocityNot monotonic
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%) 
16023459.2%
 
31395413.7%
 
61290212.7%
 
1836913.6%
 
221282.1%
 
2219932.0%
 
1116421.6%
 
511841.2%
 
259891.0%
 
48150.8%
 
76230.6%
 
234120.4%
 
133990.4%
 
143720.4%
 
281390.1%
 
81080.1%
 
15630.1%
 
2448< 0.1%
 
921< 0.1%
 
1714< 0.1%
 
1611< 0.1%
 
198< 0.1%
 
106< 0.1%
 
275< 0.1%
 
123< 0.1%
 
ValueCountFrequency (%) 
16023459.2%
 
221282.1%
 
31395413.7%
 
48150.8%
 
511841.2%
 
61290212.7%
 
76230.6%
 
81080.1%
 
921< 0.1%
 
106< 0.1%
 
ValueCountFrequency (%) 
281390.1%
 
275< 0.1%
 
259891.0%
 
2448< 0.1%
 
234120.4%
 
2219932.0%
 
202< 0.1%
 
198< 0.1%
 
1836913.6%
 
1714< 0.1%
 

admission_source_id
Real number (ℝ≥0)

Distinct17
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.754436649
Minimum1
Maximum25
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median7
Q37
95-th percentile17
Maximum25
Range24
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.064080834
Coefficient of variation (CV)0.7062517293
Kurtosis1.744989372
Mean5.754436649
Median Absolute Deviation (MAD)0
Skewness1.029934878
Sum585606
Variance16.51675303
MonotocityNot monotonic
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%) 
75749456.5%
 
12956529.1%
 
1767816.7%
 
431873.1%
 
622642.2%
 
211041.1%
 
58550.8%
 
31870.2%
 
201610.2%
 
91250.1%
 
816< 0.1%
 
2212< 0.1%
 
108< 0.1%
 
112< 0.1%
 
142< 0.1%
 
252< 0.1%
 
131< 0.1%
 
ValueCountFrequency (%) 
12956529.1%
 
211041.1%
 
31870.2%
 
431873.1%
 
58550.8%
 
622642.2%
 
75749456.5%
 
816< 0.1%
 
91250.1%
 
108< 0.1%
 
ValueCountFrequency (%) 
252< 0.1%
 
2212< 0.1%
 
201610.2%
 
1767816.7%
 
142< 0.1%
 
131< 0.1%
 
112< 0.1%
 
108< 0.1%
 
91250.1%
 
816< 0.1%
 

time_in_hospital
Real number (ℝ≥0)

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.395986872
Minimum1
Maximum14
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q36
95-th percentile11
Maximum14
Range13
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.985107767
Coefficient of variation (CV)0.6790529304
Kurtosis0.8502508405
Mean4.395986872
Median Absolute Deviation (MAD)2
Skewness1.133998719
Sum447362
Variance8.910868383
MonotocityNot monotonic
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%) 
31775617.4%
 
21722416.9%
 
11420814.0%
 
41392413.7%
 
599669.8%
 
675397.4%
 
758595.8%
 
843914.3%
 
930022.9%
 
1023422.3%
 
1118551.8%
 
1214481.4%
 
1312101.2%
 
1410421.0%
 
ValueCountFrequency (%) 
11420814.0%
 
21722416.9%
 
31775617.4%
 
41392413.7%
 
599669.8%
 
675397.4%
 
758595.8%
 
843914.3%
 
930022.9%
 
1023422.3%
 
ValueCountFrequency (%) 
1410421.0%
 
1312101.2%
 
1214481.4%
 
1118551.8%
 
1023422.3%
 
930022.9%
 
843914.3%
 
758595.8%
 
675397.4%
 
599669.8%
 

payer_code
Categorical

MISSING

Distinct17
Distinct (%)< 0.1%
Missing40256
Missing (%)39.6%
Memory size795.2 KiB
MC
32439 
HM
6274 
SP
5007 
BC
4655 
MD
3532 
Other values (12)
9603 
ValueCountFrequency (%) 
MC3243931.9%
 
HM62746.2%
 
SP50074.9%
 
BC46554.6%
 
MD35323.5%
 
CP25332.5%
 
UN24482.4%
 
CM19371.9%
 
OG10331.0%
 
PO5920.6%
 
DM5490.5%
 
CH1460.1%
 
WC1350.1%
 
OT950.1%
 
MP790.1%
 
SI550.1%
 
FR1< 0.1%
 
(Missing)4025639.6%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length3
Median length2
Mean length2.39557416
Min length2

Overview of Unicode Properties

Unique unicode characters18
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
n8051233.0%
 
M4481018.4%
 
C4184517.2%
 
a4025616.5%
 
P82113.4%
 
H64202.6%
 
S50622.1%
 
B46551.9%
 
D40811.7%
 
U24481.0%
 
N24481.0%
 
O17200.7%
 
G10330.4%
 
W1350.1%
 
T95< 0.1%
 
I55< 0.1%
 
F1< 0.1%
 
R1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter12302050.5%
 
Lowercase Letter12076849.5%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n8051266.7%
 
a4025633.3%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M4481036.4%
 
C4184534.0%
 
P82116.7%
 
H64205.2%
 
S50624.1%
 
B46553.8%
 
D40813.3%
 
U24482.0%
 
N24482.0%
 
O17201.4%
 
G10330.8%
 
W1350.1%
 
T950.1%
 
I55< 0.1%
 
F1< 0.1%
 
R1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin243788100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n8051233.0%
 
M4481018.4%
 
C4184517.2%
 
a4025616.5%
 
P82113.4%
 
H64202.6%
 
S50622.1%
 
B46551.9%
 
D40811.7%
 
U24481.0%
 
N24481.0%
 
O17200.7%
 
G10330.4%
 
W1350.1%
 
T95< 0.1%
 
I55< 0.1%
 
F1< 0.1%
 
R1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII243788100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
n8051233.0%
 
M4481018.4%
 
C4184517.2%
 
a4025616.5%
 
P82113.4%
 
H64202.6%
 
S50622.1%
 
B46551.9%
 
D40811.7%
 
U24481.0%
 
N24481.0%
 
O17200.7%
 
G10330.4%
 
W1350.1%
 
T95< 0.1%
 
I55< 0.1%
 
F1< 0.1%
 
R1< 0.1%
 

medical_specialty
Categorical

HIGH CARDINALITY
MISSING

Distinct72
Distinct (%)0.1%
Missing49949
Missing (%)49.1%
Memory size795.2 KiB
InternalMedicine
14635 
Emergency/Trauma
7565 
Family/GeneralPractice
7440 
Cardiology
5352 
Surgery-General
3099 
Other values (67)
13726 
ValueCountFrequency (%) 
InternalMedicine1463514.4%
 
Emergency/Trauma75657.4%
 
Family/GeneralPractice74407.3%
 
Cardiology53525.3%
 
Surgery-General30993.0%
 
Nephrology16131.6%
 
Orthopedics14001.4%
 
Orthopedics-Reconstructive12331.2%
 
Radiologist11401.1%
 
Pulmonology8710.9%
 
Psychiatry8540.8%
 
Urology6850.7%
 
ObstetricsandGynecology6710.7%
 
Surgery-Cardiovascular/Thoracic6520.6%
 
Gastroenterology5640.6%
 
Surgery-Vascular5330.5%
 
Surgery-Neuro4680.5%
 
PhysicalMedicineandRehabilitation3910.4%
 
Oncology3480.3%
 
Pediatrics2540.2%
 
Hematology/Oncology2070.2%
 
Neurology2030.2%
 
Pediatrics-Endocrinology1590.2%
 
Otolaryngology1250.1%
 
Endocrinology1200.1%
 
Other values (47)12351.2%
 
(Missing)4994949.1%
 
Frequencies of value counts

Unique

Unique9 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length36
Median length8
Mean length9.594314408
Min length3

Overview of Unicode Properties

Unique unicode characters43
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
n16869617.3%
 
a12109812.4%
 
e10515110.8%
 
r768997.9%
 
i633086.5%
 
c500075.1%
 
l488715.0%
 
y349373.6%
 
t341493.5%
 
o340533.5%
 
d270352.8%
 
g255962.6%
 
m238462.4%
 
u168561.7%
 
/158711.6%
 
M150551.5%
 
I146831.5%
 
G118821.2%
 
s106381.1%
 
P104481.1%
 
T83320.9%
 
E78610.8%
 
F74510.8%
 
h69650.7%
 
-66270.7%
 
Other values (18)300603.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter85569387.6%
 
Uppercase Letter9814810.1%
 
Other Punctuation159071.6%
 
Dash Punctuation66270.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
M1505515.3%
 
I1468315.0%
 
G1188212.1%
 
P1044810.6%
 
T83328.5%
 
E78618.0%
 
F74517.6%
 
C63076.4%
 
S51565.3%
 
O41464.2%
 
R28472.9%
 
N23072.4%
 
U6850.7%
 
V5330.5%
 
H3510.4%
 
A550.1%
 
D49< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n16869619.7%
 
a12109814.2%
 
e10515112.3%
 
r768999.0%
 
i633087.4%
 
c500075.8%
 
l488715.7%
 
y349374.1%
 
t341494.0%
 
o340534.0%
 
d270353.2%
 
g255963.0%
 
m238462.8%
 
u168562.0%
 
s106381.2%
 
h69650.8%
 
p44160.5%
 
v19960.2%
 
b11140.1%
 
f49< 0.1%
 
x11< 0.1%
 
w1< 0.1%
 
k1< 0.1%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-6627100.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/1587199.8%
 
&360.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin95384197.7%
 
Common225342.3%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n16869617.7%
 
a12109812.7%
 
e10515111.0%
 
r768998.1%
 
i633086.6%
 
c500075.2%
 
l488715.1%
 
y349373.7%
 
t341493.6%
 
o340533.6%
 
d270352.8%
 
g255962.7%
 
m238462.5%
 
u168561.8%
 
M150551.6%
 
I146831.5%
 
G118821.2%
 
s106381.1%
 
P104481.1%
 
T83320.9%
 
E78610.8%
 
F74510.8%
 
h69650.7%
 
C63070.7%
 
S51560.5%
 
Other values (15)185611.9%
 

Most frequent Common characters

ValueCountFrequency (%) 
/1587170.4%
 
-662729.4%
 
&360.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII976375100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
n16869617.3%
 
a12109812.4%
 
e10515110.8%
 
r768997.9%
 
i633086.5%
 
c500075.1%
 
l488715.0%
 
y349373.6%
 
t341493.5%
 
o340533.5%
 
d270352.8%
 
g255962.6%
 
m238462.4%
 
u168561.7%
 
/158711.6%
 
M150551.5%
 
I146831.5%
 
G118821.2%
 
s106381.1%
 
P104481.1%
 
T83320.9%
 
E78610.8%
 
F74510.8%
 
h69650.7%
 
-66270.7%
 
Other values (18)300603.1%
 

num_lab_procedures
Real number (ℝ≥0)

Distinct118
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.09564098
Minimum1
Maximum132
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum1
5-th percentile4
Q131
median44
Q357
95-th percentile73
Maximum132
Range131
Interquartile range (IQR)26

Descriptive statistics

Standard deviation19.67436225
Coefficient of variation (CV)0.4565278947
Kurtosis-0.2450735189
Mean43.09564098
Median Absolute Deviation (MAD)13
Skewness-0.2365439206
Sum4385671
Variance387.0805299
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
132083.2%
 
4328042.8%
 
4424962.5%
 
4523762.3%
 
3822132.2%
 
4022012.2%
 
4621892.2%
 
4121172.1%
 
4221132.1%
 
4721062.1%
 
3921012.1%
 
3720792.0%
 
4920662.0%
 
4820582.0%
 
3619621.9%
 
5119251.9%
 
5019241.9%
 
3519071.9%
 
5418881.9%
 
5618391.8%
 
5218381.8%
 
5518361.8%
 
5318021.8%
 
5717471.7%
 
5817081.7%
 
Other values (93)4926348.4%
 
ValueCountFrequency (%) 
132083.2%
 
211011.1%
 
36680.7%
 
43780.4%
 
52860.3%
 
62820.3%
 
73230.3%
 
83660.4%
 
99330.9%
 
108380.8%
 
ValueCountFrequency (%) 
1321< 0.1%
 
1291< 0.1%
 
1261< 0.1%
 
1211< 0.1%
 
1201< 0.1%
 
1181< 0.1%
 
1142< 0.1%
 
1133< 0.1%
 
1113< 0.1%
 
1094< 0.1%
 

num_procedures
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.339730362
Minimum0
Maximum6
Zeros46652
Zeros (%)45.8%
Memory size795.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q32
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.705806979
Coefficient of variation (CV)1.273246489
Kurtosis0.8571103021
Mean1.339730362
Median Absolute Deviation (MAD)1
Skewness1.316414763
Sum136339
Variance2.90977745
MonotocityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%) 
04665245.8%
 
12074220.4%
 
21271712.5%
 
394439.3%
 
649544.9%
 
441804.1%
 
530783.0%
 
ValueCountFrequency (%) 
04665245.8%
 
12074220.4%
 
21271712.5%
 
394439.3%
 
441804.1%
 
530783.0%
 
649544.9%
 
ValueCountFrequency (%) 
649544.9%
 
530783.0%
 
441804.1%
 
394439.3%
 
21271712.5%
 
12074220.4%
 
04665245.8%
 

num_medications
Real number (ℝ≥0)

Distinct75
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.02184423
Minimum1
Maximum81
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum1
5-th percentile6
Q110
median15
Q320
95-th percentile31
Maximum81
Range80
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.127566209
Coefficient of variation (CV)0.5072803163
Kurtosis3.468154915
Mean16.02184423
Median Absolute Deviation (MAD)5
Skewness1.326672134
Sum1630479
Variance66.05733248
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1360866.0%
 
1260045.9%
 
1157955.7%
 
1557925.7%
 
1457075.6%
 
1654305.3%
 
1053465.3%
 
1749194.8%
 
949134.8%
 
1845234.4%
 
843534.3%
 
1940784.0%
 
2036913.6%
 
734843.4%
 
2132303.2%
 
2228682.8%
 
626992.7%
 
2324262.4%
 
2421092.1%
 
520172.0%
 
2518881.9%
 
2616081.6%
 
2714321.4%
 
414171.4%
 
2812331.2%
 
Other values (50)87188.6%
 
ValueCountFrequency (%) 
12620.3%
 
24700.5%
 
39000.9%
 
414171.4%
 
520172.0%
 
626992.7%
 
734843.4%
 
843534.3%
 
949134.8%
 
1053465.3%
 
ValueCountFrequency (%) 
811< 0.1%
 
791< 0.1%
 
752< 0.1%
 
741< 0.1%
 
723< 0.1%
 
702< 0.1%
 
695< 0.1%
 
687< 0.1%
 
677< 0.1%
 
665< 0.1%
 

number_outpatient
Real number (ℝ≥0)

ZEROS

Distinct39
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3693571527
Minimum0
Maximum42
Zeros85027
Zeros (%)83.6%
Memory size795.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum42
Range42
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.267265097
Coefficient of variation (CV)3.431001911
Kurtosis147.9077363
Mean0.3693571527
Median Absolute Deviation (MAD)0
Skewness8.832958927
Sum37588
Variance1.605960825
MonotocityNot monotonic
Histogram with fixed size bins (bins=39)
ValueCountFrequency (%) 
08502783.6%
 
185478.4%
 
235943.5%
 
320422.0%
 
410991.1%
 
55330.5%
 
63030.3%
 
71550.2%
 
8980.1%
 
9830.1%
 
10570.1%
 
1142< 0.1%
 
1331< 0.1%
 
1230< 0.1%
 
1428< 0.1%
 
1520< 0.1%
 
1615< 0.1%
 
178< 0.1%
 
217< 0.1%
 
207< 0.1%
 
225< 0.1%
 
185< 0.1%
 
193< 0.1%
 
243< 0.1%
 
273< 0.1%
 
Other values (14)21< 0.1%
 
ValueCountFrequency (%) 
08502783.6%
 
185478.4%
 
235943.5%
 
320422.0%
 
410991.1%
 
55330.5%
 
63030.3%
 
71550.2%
 
8980.1%
 
9830.1%
 
ValueCountFrequency (%) 
421< 0.1%
 
401< 0.1%
 
391< 0.1%
 
381< 0.1%
 
371< 0.1%
 
362< 0.1%
 
352< 0.1%
 
341< 0.1%
 
332< 0.1%
 
292< 0.1%
 

number_emergency
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct33
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1978362125
Minimum0
Maximum76
Zeros90383
Zeros (%)88.8%
Memory size795.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum76
Range76
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.9304722684
Coefficient of variation (CV)4.703245461
Kurtosis1191.686726
Mean0.1978362125
Median Absolute Deviation (MAD)0
Skewness22.85558215
Sum20133
Variance0.8657786423
MonotocityNot monotonic
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%) 
09038388.8%
 
176777.5%
 
220422.0%
 
37250.7%
 
43740.4%
 
51920.2%
 
6940.1%
 
7730.1%
 
850< 0.1%
 
1034< 0.1%
 
933< 0.1%
 
1123< 0.1%
 
1312< 0.1%
 
1210< 0.1%
 
226< 0.1%
 
185< 0.1%
 
165< 0.1%
 
194< 0.1%
 
204< 0.1%
 
143< 0.1%
 
153< 0.1%
 
212< 0.1%
 
252< 0.1%
 
761< 0.1%
 
541< 0.1%
 
Other values (8)8< 0.1%
 
ValueCountFrequency (%) 
09038388.8%
 
176777.5%
 
220422.0%
 
37250.7%
 
43740.4%
 
51920.2%
 
6940.1%
 
7730.1%
 
850< 0.1%
 
933< 0.1%
 
ValueCountFrequency (%) 
761< 0.1%
 
641< 0.1%
 
631< 0.1%
 
541< 0.1%
 
461< 0.1%
 
421< 0.1%
 
371< 0.1%
 
291< 0.1%
 
281< 0.1%
 
252< 0.1%
 

number_inpatient
Real number (ℝ≥0)

ZEROS

Distinct21
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6355659061
Minimum0
Maximum21
Zeros67630
Zeros (%)66.5%
Memory size795.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum21
Range21
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.26286329
Coefficient of variation (CV)1.986990299
Kurtosis20.71939695
Mean0.6355659061
Median Absolute Deviation (MAD)0
Skewness3.614138992
Sum64679
Variance1.594823689
MonotocityNot monotonic
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%) 
06763066.5%
 
11952119.2%
 
275667.4%
 
334113.4%
 
416221.6%
 
58120.8%
 
64800.5%
 
72680.3%
 
81510.1%
 
91110.1%
 
10610.1%
 
1149< 0.1%
 
1234< 0.1%
 
1320< 0.1%
 
1410< 0.1%
 
159< 0.1%
 
166< 0.1%
 
192< 0.1%
 
171< 0.1%
 
181< 0.1%
 
211< 0.1%
 
ValueCountFrequency (%) 
06763066.5%
 
11952119.2%
 
275667.4%
 
334113.4%
 
416221.6%
 
58120.8%
 
64800.5%
 
72680.3%
 
81510.1%
 
91110.1%
 
ValueCountFrequency (%) 
211< 0.1%
 
192< 0.1%
 
181< 0.1%
 
171< 0.1%
 
166< 0.1%
 
159< 0.1%
 
1410< 0.1%
 
1320< 0.1%
 
1234< 0.1%
 
1149< 0.1%
 

diag_1
Categorical

HIGH CARDINALITY

Distinct716
Distinct (%)0.7%
Missing21
Missing (%)< 0.1%
Memory size795.2 KiB
428
 
6862
414
 
6581
786
 
4016
410
 
3614
486
 
3508
Other values (711)
77164 
ValueCountFrequency (%) 
42868626.7%
 
41465816.5%
 
78640163.9%
 
41036143.6%
 
48635083.4%
 
42727662.7%
 
49122752.2%
 
71521512.1%
 
68220422.0%
 
43420282.0%
 
78020192.0%
 
99619671.9%
 
27618891.9%
 
3816881.7%
 
250.816801.7%
 
59915951.6%
 
58415201.5%
 
V5712071.2%
 
250.611831.2%
 
51811151.1%
 
82010821.1%
 
57710571.0%
 
49310561.0%
 
43510161.0%
 
5629891.0%
 
Other values (691)4483944.1%
 
Frequencies of value counts

Unique

Unique82 ?
Unique (%)0.1%
Histogram of lengths of the category

Length

Max length6
Median length3
Mean length3.175628402
Min length1

Overview of Unicode Properties

Unique unicode characters15
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
45545717.2%
 
23987612.3%
 
83794911.7%
 
53713111.5%
 
7286688.9%
 
1281068.7%
 
0249607.7%
 
6231987.2%
 
9199786.2%
 
3176185.5%
 
.85222.6%
 
V16440.5%
 
n42< 0.1%
 
a21< 0.1%
 
E1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number31294196.8%
 
Other Punctuation85222.6%
 
Uppercase Letter16450.5%
 
Lowercase Letter63< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
45545717.7%
 
23987612.7%
 
83794912.1%
 
53713111.9%
 
7286689.2%
 
1281069.0%
 
0249608.0%
 
6231987.4%
 
9199786.4%
 
3176185.6%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.8522100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
V164499.9%
 
E10.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n4266.7%
 
a2133.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Common32146399.5%
 
Latin17080.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
45545717.3%
 
23987612.4%
 
83794911.8%
 
53713111.6%
 
7286688.9%
 
1281068.7%
 
0249607.8%
 
6231987.2%
 
9199786.2%
 
3176185.5%
 
.85222.7%
 

Most frequent Latin characters

ValueCountFrequency (%) 
V164496.3%
 
n422.5%
 
a211.2%
 
E10.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII323171100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
45545717.2%
 
23987612.3%
 
83794911.7%
 
53713111.5%
 
7286688.9%
 
1281068.7%
 
0249607.7%
 
6231987.2%
 
9199786.2%
 
3176185.5%
 
.85222.6%
 
V16440.5%
 
n42< 0.1%
 
a21< 0.1%
 
E1< 0.1%
 

diag_2
Categorical

HIGH CARDINALITY

Distinct748
Distinct (%)0.7%
Missing358
Missing (%)0.4%
Memory size795.2 KiB
276
 
6752
428
 
6662
250
 
6071
427
 
5036
401
 
3736
Other values (743)
73151 
ValueCountFrequency (%) 
27667526.6%
 
42866626.5%
 
25060716.0%
 
42750364.9%
 
40137363.7%
 
49633053.2%
 
59932883.2%
 
40328232.8%
 
41426502.6%
 
41125662.5%
 
250.0220742.0%
 
70719992.0%
 
58518711.8%
 
58416491.6%
 
49115451.5%
 
250.0115231.5%
 
28515201.5%
 
78014911.5%
 
42514341.4%
 
68214331.4%
 
48613791.4%
 
51813551.3%
 
42410711.1%
 
41310421.0%
 
250.68950.9%
 
Other values (723)3623835.6%
 
Frequencies of value counts

Unique

Unique124 ?
Unique (%)0.1%
Histogram of lengths of the category

Length

Max length6
Median length3
Mean length3.173230745
Min length1

Overview of Unicode Properties

Unique unicode characters15
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
45115515.8%
 
24976515.4%
 
53817611.8%
 
03404610.5%
 
8287118.9%
 
7286548.9%
 
1261588.1%
 
9218426.8%
 
6199906.2%
 
3140974.4%
 
.67232.1%
 
V18050.6%
 
E7310.2%
 
n7160.2%
 
a3580.1%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number31259496.8%
 
Other Punctuation67232.1%
 
Uppercase Letter25360.8%
 
Lowercase Letter10740.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n71666.7%
 
a35833.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
45115516.4%
 
24976515.9%
 
53817612.2%
 
03404610.9%
 
8287119.2%
 
7286549.2%
 
1261588.4%
 
9218427.0%
 
6199906.4%
 
3140974.5%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.6723100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
V180571.2%
 
E73128.8%
 

Most occurring scripts

ValueCountFrequency (%) 
Common31931798.9%
 
Latin36101.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
V180550.0%
 
E73120.2%
 
n71619.8%
 
a3589.9%
 

Most frequent Common characters

ValueCountFrequency (%) 
45115516.0%
 
24976515.6%
 
53817612.0%
 
03404610.7%
 
8287119.0%
 
7286549.0%
 
1261588.2%
 
9218426.8%
 
6199906.3%
 
3140974.4%
 
.67232.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII322927100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
45115515.8%
 
24976515.4%
 
53817611.8%
 
03404610.5%
 
8287118.9%
 
7286548.9%
 
1261588.1%
 
9218426.8%
 
6199906.2%
 
3140974.4%
 
.67232.1%
 
V18050.6%
 
E7310.2%
 
n7160.2%
 
a3580.1%
 

diag_3
Categorical

HIGH CARDINALITY
MISSING

Distinct789
Distinct (%)0.8%
Missing1423
Missing (%)1.4%
Memory size795.2 KiB
250
11555 
401
8289 
276
 
5175
428
 
4577
427
 
3955
Other values (784)
66792 
ValueCountFrequency (%) 
2501155511.4%
 
40182898.1%
 
27651755.1%
 
42845774.5%
 
42739553.9%
 
41436643.6%
 
49626052.6%
 
40323572.3%
 
58519922.0%
 
27219691.9%
 
59919411.9%
 
V4513891.4%
 
250.0213691.3%
 
70713601.3%
 
78013341.3%
 
28512001.2%
 
42511361.1%
 
250.610801.1%
 
42410631.0%
 
5849630.9%
 
3059240.9%
 
250.019150.9%
 
6828870.9%
 
5188540.8%
 
417270.7%
 
Other values (764)3706336.4%
 
(Missing)14231.4%
 
Frequencies of value counts

Unique

Unique122 ?
Unique (%)0.1%
Histogram of lengths of the category

Length

Max length6
Median length3
Mean length3.139624236
Min length1

Overview of Unicode Properties

Unique unicode characters15
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
25124416.0%
 
44925215.4%
 
54126012.9%
 
03971112.4%
 
7265048.3%
 
1246847.7%
 
8238257.5%
 
9173235.4%
 
6164415.1%
 
3143334.5%
 
.56031.8%
 
V38141.2%
 
n28460.9%
 
a14230.4%
 
E12440.4%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number30457795.3%
 
Other Punctuation56031.8%
 
Uppercase Letter50581.6%
 
Lowercase Letter42691.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n284666.7%
 
a142333.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
25124416.8%
 
44925216.2%
 
54126013.5%
 
03971113.0%
 
7265048.7%
 
1246848.1%
 
8238257.8%
 
9173235.7%
 
6164415.4%
 
3143334.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
V381475.4%
 
E124424.6%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
.5603100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common31018097.1%
 
Latin93272.9%
 

Most frequent Latin characters

ValueCountFrequency (%) 
V381440.9%
 
n284630.5%
 
a142315.3%
 
E124413.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
25124416.5%
 
44925215.9%
 
54126013.3%
 
03971112.8%
 
7265048.5%
 
1246848.0%
 
8238257.7%
 
9173235.6%
 
6164415.3%
 
3143334.6%
 
.56031.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII319507100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
25124416.0%
 
44925215.4%
 
54126012.9%
 
03971112.4%
 
7265048.3%
 
1246847.7%
 
8238257.5%
 
9173235.4%
 
6164415.1%
 
3143334.5%
 
.56031.8%
 
V38141.2%
 
n28460.9%
 
a14230.4%
 
E12440.4%
 

number_diagnoses
Real number (ℝ≥0)

Distinct16
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.422606765
Minimum1
Maximum16
Zeros0
Zeros (%)0.0%
Memory size795.2 KiB

Quantile statistics

Minimum1
5-th percentile4
Q16
median8
Q39
95-th percentile9
Maximum16
Range15
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.933600145
Coefficient of variation (CV)0.2605014931
Kurtosis-0.07905602427
Mean7.422606765
Median Absolute Deviation (MAD)1
Skewness-0.8767462388
Sum755369
Variance3.738809521
MonotocityNot monotonic
Histogram with fixed size bins (bins=16)
ValueCountFrequency (%) 
94947448.6%
 
51139311.2%
 
81061610.4%
 
71039310.2%
 
61016110.0%
 
455375.4%
 
328352.8%
 
210231.0%
 
12190.2%
 
1645< 0.1%
 
1017< 0.1%
 
1316< 0.1%
 
1111< 0.1%
 
1510< 0.1%
 
129< 0.1%
 
147< 0.1%
 
ValueCountFrequency (%) 
12190.2%
 
210231.0%
 
328352.8%
 
455375.4%
 
51139311.2%
 
61016110.0%
 
71039310.2%
 
81061610.4%
 
94947448.6%
 
1017< 0.1%
 
ValueCountFrequency (%) 
1645< 0.1%
 
1510< 0.1%
 
147< 0.1%
 
1316< 0.1%
 
129< 0.1%
 
1111< 0.1%
 
1017< 0.1%
 
94947448.6%
 
81061610.4%
 
71039310.2%
 

max_glu_serum
Categorical

MISSING

Distinct3
Distinct (%)0.1%
Missing96420
Missing (%)94.7%
Memory size795.2 KiB
Norm
2597 
>200
1485 
>300
1264 
ValueCountFrequency (%) 
Norm25972.6%
 
>20014851.5%
 
>30012641.2%
 
(Missing)9642094.7%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length4
Median length3
Mean length3.05253228
Min length3

Overview of Unicode Properties

Unique unicode characters10
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
n19284062.1%
 
a9642031.0%
 
054981.8%
 
>27490.9%
 
N25970.8%
 
o25970.8%
 
r25970.8%
 
m25970.8%
 
214850.5%
 
312640.4%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter29705195.6%
 
Decimal Number82472.7%
 
Math Symbol27490.9%
 
Uppercase Letter25970.8%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n19284064.9%
 
a9642032.5%
 
o25970.9%
 
r25970.9%
 
m25970.9%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
>2749100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
0549866.7%
 
2148518.0%
 
3126415.3%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N2597100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin29964896.5%
 
Common109963.5%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n19284064.4%
 
a9642032.2%
 
N25970.9%
 
o25970.9%
 
r25970.9%
 
m25970.9%
 

Most frequent Common characters

ValueCountFrequency (%) 
0549850.0%
 
>274925.0%
 
2148513.5%
 
3126411.5%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII310644100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
n19284062.1%
 
a9642031.0%
 
054981.8%
 
>27490.9%
 
N25970.8%
 
o25970.8%
 
r25970.8%
 
m25970.8%
 
214850.5%
 
312640.4%
 

A1Cresult
Categorical

MISSING

Distinct3
Distinct (%)< 0.1%
Missing84748
Missing (%)83.3%
Memory size795.2 KiB
>8
8216 
Norm
4990 
>7
3812 
ValueCountFrequency (%) 
>882168.1%
 
Norm49904.9%
 
>738123.7%
 
(Missing)8474883.3%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length4
Median length3
Mean length2.930841342
Min length2

Overview of Unicode Properties

Unique unicode characters9
Unique unicode categories4 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
n16949656.8%
 
a8474828.4%
 
>120284.0%
 
882162.8%
 
N49901.7%
 
o49901.7%
 
r49901.7%
 
m49901.7%
 
738121.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter26921490.3%
 
Math Symbol120284.0%
 
Decimal Number120284.0%
 
Uppercase Letter49901.7%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n16949663.0%
 
a8474831.5%
 
o49901.9%
 
r49901.9%
 
m49901.9%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
>12028100.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
8821668.3%
 
7381231.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N4990100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin27420491.9%
 
Common240568.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n16949661.8%
 
a8474830.9%
 
N49901.8%
 
o49901.8%
 
r49901.8%
 
m49901.8%
 

Most frequent Common characters

ValueCountFrequency (%) 
>1202850.0%
 
8821634.2%
 
7381215.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII298260100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
n16949656.8%
 
a8474828.4%
 
>120284.0%
 
882162.8%
 
N49901.7%
 
o49901.7%
 
r49901.7%
 
m49901.7%
 
738121.3%
 

metformin
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
81778 
Steady
18346 
Up
 
1067
Down
 
575
ValueCountFrequency (%) 
No8177880.4%
 
Steady1834618.0%
 
Up10671.0%
 
Down5750.6%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.732405715
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o8235329.6%
 
N8177829.4%
 
S183466.6%
 
t183466.6%
 
e183466.6%
 
a183466.6%
 
d183466.6%
 
y183466.6%
 
U10670.4%
 
p10670.4%
 
D5750.2%
 
w5750.2%
 
n5750.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter17630063.4%
 
Uppercase Letter10176636.6%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N8177880.4%
 
S1834618.0%
 
U10671.0%
 
D5750.6%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o8235346.7%
 
t1834610.4%
 
e1834610.4%
 
a1834610.4%
 
d1834610.4%
 
y1834610.4%
 
p10670.6%
 
w5750.3%
 
n5750.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin278066100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o8235329.6%
 
N8177829.4%
 
S183466.6%
 
t183466.6%
 
e183466.6%
 
a183466.6%
 
d183466.6%
 
y183466.6%
 
U10670.4%
 
p10670.4%
 
D5750.2%
 
w5750.2%
 
n5750.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII278066100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o8235329.6%
 
N8177829.4%
 
S183466.6%
 
t183466.6%
 
e183466.6%
 
a183466.6%
 
d183466.6%
 
y183466.6%
 
U10670.4%
 
p10670.4%
 
D5750.2%
 
w5750.2%
 
n5750.2%
 

repaglinide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
100227 
Steady
 
1384
Up
 
110
Down
 
45
ValueCountFrequency (%) 
No10022798.5%
 
Steady13841.4%
 
Up1100.1%
 
Down45< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.05528369
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o10027247.9%
 
N10022747.9%
 
S13840.7%
 
t13840.7%
 
e13840.7%
 
a13840.7%
 
d13840.7%
 
y13840.7%
 
U1100.1%
 
p1100.1%
 
D45< 0.1%
 
w45< 0.1%
 
n45< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10739251.3%
 
Uppercase Letter10176648.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N10022798.5%
 
S13841.4%
 
U1100.1%
 
D45< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10027293.4%
 
t13841.3%
 
e13841.3%
 
a13841.3%
 
d13841.3%
 
y13841.3%
 
p1100.1%
 
w45< 0.1%
 
n45< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin209158100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o10027247.9%
 
N10022747.9%
 
S13840.7%
 
t13840.7%
 
e13840.7%
 
a13840.7%
 
d13840.7%
 
y13840.7%
 
U1100.1%
 
p1100.1%
 
D45< 0.1%
 
w45< 0.1%
 
n45< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII209158100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o10027247.9%
 
N10022747.9%
 
S13840.7%
 
t13840.7%
 
e13840.7%
 
a13840.7%
 
d13840.7%
 
y13840.7%
 
U1100.1%
 
p1100.1%
 
D45< 0.1%
 
w45< 0.1%
 
n45< 0.1%
 

nateglinide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101063 
Steady
 
668
Up
 
24
Down
 
11
ValueCountFrequency (%) 
No10106399.3%
 
Steady6680.7%
 
Up24< 0.1%
 
Down11< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.026472496
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o10107449.0%
 
N10106349.0%
 
S6680.3%
 
t6680.3%
 
e6680.3%
 
a6680.3%
 
d6680.3%
 
y6680.3%
 
U24< 0.1%
 
p24< 0.1%
 
D11< 0.1%
 
w11< 0.1%
 
n11< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10446050.7%
 
Uppercase Letter10176649.3%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N10106399.3%
 
S6680.7%
 
U24< 0.1%
 
D11< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10107496.8%
 
t6680.6%
 
e6680.6%
 
a6680.6%
 
d6680.6%
 
y6680.6%
 
p24< 0.1%
 
w11< 0.1%
 
n11< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin206226100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o10107449.0%
 
N10106349.0%
 
S6680.3%
 
t6680.3%
 
e6680.3%
 
a6680.3%
 
d6680.3%
 
y6680.3%
 
U24< 0.1%
 
p24< 0.1%
 
D11< 0.1%
 
w11< 0.1%
 
n11< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII206226100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o10107449.0%
 
N10106349.0%
 
S6680.3%
 
t6680.3%
 
e6680.3%
 
a6680.3%
 
d6680.3%
 
y6680.3%
 
U24< 0.1%
 
p24< 0.1%
 
D11< 0.1%
 
w11< 0.1%
 
n11< 0.1%
 

chlorpropamide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101680 
Steady
 
79
Up
 
6
Down
 
1
ValueCountFrequency (%) 
No10168099.9%
 
Steady790.1%
 
Up6< 0.1%
 
Down1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.003124816
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o10168149.9%
 
N10168049.9%
 
S79< 0.1%
 
t79< 0.1%
 
e79< 0.1%
 
a79< 0.1%
 
d79< 0.1%
 
y79< 0.1%
 
U6< 0.1%
 
p6< 0.1%
 
D1< 0.1%
 
w1< 0.1%
 
n1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10208450.1%
 
Uppercase Letter10176649.9%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N10168099.9%
 
S790.1%
 
U6< 0.1%
 
D1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10168199.6%
 
t790.1%
 
e790.1%
 
a790.1%
 
d790.1%
 
y790.1%
 
p6< 0.1%
 
w1< 0.1%
 
n1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203850100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o10168149.9%
 
N10168049.9%
 
S79< 0.1%
 
t79< 0.1%
 
e79< 0.1%
 
a79< 0.1%
 
d79< 0.1%
 
y79< 0.1%
 
U6< 0.1%
 
p6< 0.1%
 
D1< 0.1%
 
w1< 0.1%
 
n1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203850100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o10168149.9%
 
N10168049.9%
 
S79< 0.1%
 
t79< 0.1%
 
e79< 0.1%
 
a79< 0.1%
 
d79< 0.1%
 
y79< 0.1%
 
U6< 0.1%
 
p6< 0.1%
 
D1< 0.1%
 
w1< 0.1%
 
n1< 0.1%
 

glimepiride
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
96575 
Steady
 
4670
Up
 
327
Down
 
194
ValueCountFrequency (%) 
No9657594.9%
 
Steady46704.6%
 
Up3270.3%
 
Down1940.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.187371028
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o9676943.5%
 
N9657543.4%
 
S46702.1%
 
t46702.1%
 
e46702.1%
 
a46702.1%
 
d46702.1%
 
y46702.1%
 
U3270.1%
 
p3270.1%
 
D1940.1%
 
w1940.1%
 
n1940.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter12083454.3%
 
Uppercase Letter10176645.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N9657594.9%
 
S46704.6%
 
U3270.3%
 
D1940.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o9676980.1%
 
t46703.9%
 
e46703.9%
 
a46703.9%
 
d46703.9%
 
y46703.9%
 
p3270.3%
 
w1940.2%
 
n1940.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin222600100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o9676943.5%
 
N9657543.4%
 
S46702.1%
 
t46702.1%
 
e46702.1%
 
a46702.1%
 
d46702.1%
 
y46702.1%
 
U3270.1%
 
p3270.1%
 
D1940.1%
 
w1940.1%
 
n1940.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII222600100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o9676943.5%
 
N9657543.4%
 
S46702.1%
 
t46702.1%
 
e46702.1%
 
a46702.1%
 
d46702.1%
 
y46702.1%
 
U3270.1%
 
p3270.1%
 
D1940.1%
 
w1940.1%
 
n1940.1%
 

acetohexamide
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101765 
Steady
 
1
ValueCountFrequency (%) 
No101765> 99.9%
 
Steady1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000039306
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10177050.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101765> 99.9%
 
S1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o101765> 99.9%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203536100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203536100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

glipizide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
89080 
Steady
11356 
Up
 
770
Down
 
560
ValueCountFrequency (%) 
No8908087.5%
 
Steady1135611.2%
 
Up7700.8%
 
Down5600.6%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.45736297
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o8964035.8%
 
N8908035.6%
 
S113564.5%
 
t113564.5%
 
e113564.5%
 
a113564.5%
 
d113564.5%
 
y113564.5%
 
U7700.3%
 
p7700.3%
 
D5600.2%
 
w5600.2%
 
n5600.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter14831059.3%
 
Uppercase Letter10176640.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N8908087.5%
 
S1135611.2%
 
U7700.8%
 
D5600.6%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o8964060.4%
 
t113567.7%
 
e113567.7%
 
a113567.7%
 
d113567.7%
 
y113567.7%
 
p7700.5%
 
w5600.4%
 
n5600.4%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin250076100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o8964035.8%
 
N8908035.6%
 
S113564.5%
 
t113564.5%
 
e113564.5%
 
a113564.5%
 
d113564.5%
 
y113564.5%
 
U7700.3%
 
p7700.3%
 
D5600.2%
 
w5600.2%
 
n5600.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII250076100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o8964035.8%
 
N8908035.6%
 
S113564.5%
 
t113564.5%
 
e113564.5%
 
a113564.5%
 
d113564.5%
 
y113564.5%
 
U7700.3%
 
p7700.3%
 
D5600.2%
 
w5600.2%
 
n5600.2%
 

glyburide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
91116 
Steady
9274 
Up
 
812
Down
 
564
ValueCountFrequency (%) 
No9111689.5%
 
Steady92749.1%
 
Up8120.8%
 
Down5640.6%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.375606784
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o9168037.9%
 
N9111637.7%
 
S92743.8%
 
t92743.8%
 
e92743.8%
 
a92743.8%
 
d92743.8%
 
y92743.8%
 
U8120.3%
 
p8120.3%
 
D5640.2%
 
w5640.2%
 
n5640.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter13999057.9%
 
Uppercase Letter10176642.1%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N9111689.5%
 
S92749.1%
 
U8120.8%
 
D5640.6%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o9168065.5%
 
t92746.6%
 
e92746.6%
 
a92746.6%
 
d92746.6%
 
y92746.6%
 
p8120.6%
 
w5640.4%
 
n5640.4%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin241756100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o9168037.9%
 
N9111637.7%
 
S92743.8%
 
t92743.8%
 
e92743.8%
 
a92743.8%
 
d92743.8%
 
y92743.8%
 
U8120.3%
 
p8120.3%
 
D5640.2%
 
w5640.2%
 
n5640.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII241756100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o9168037.9%
 
N9111637.7%
 
S92743.8%
 
t92743.8%
 
e92743.8%
 
a92743.8%
 
d92743.8%
 
y92743.8%
 
U8120.3%
 
p8120.3%
 
D5640.2%
 
w5640.2%
 
n5640.2%
 

tolbutamide
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101743 
Steady
 
23
ValueCountFrequency (%) 
No101743> 99.9%
 
Steady23< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000904035
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10174350.0%
 
o10174350.0%
 
S23< 0.1%
 
t23< 0.1%
 
e23< 0.1%
 
a23< 0.1%
 
d23< 0.1%
 
y23< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10185850.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101743> 99.9%
 
S23< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10174399.9%
 
t23< 0.1%
 
e23< 0.1%
 
a23< 0.1%
 
d23< 0.1%
 
y23< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203624100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10174350.0%
 
o10174350.0%
 
S23< 0.1%
 
t23< 0.1%
 
e23< 0.1%
 
a23< 0.1%
 
d23< 0.1%
 
y23< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203624100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10174350.0%
 
o10174350.0%
 
S23< 0.1%
 
t23< 0.1%
 
e23< 0.1%
 
a23< 0.1%
 
d23< 0.1%
 
y23< 0.1%
 

pioglitazone
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
94438 
Steady
 
6976
Up
 
234
Down
 
118
ValueCountFrequency (%) 
No9443892.8%
 
Steady69766.9%
 
Up2340.2%
 
Down1180.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.276516715
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o9455640.8%
 
N9443840.8%
 
S69763.0%
 
t69763.0%
 
e69763.0%
 
a69763.0%
 
d69763.0%
 
y69763.0%
 
U2340.1%
 
p2340.1%
 
D1180.1%
 
w1180.1%
 
n1180.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter12990656.1%
 
Uppercase Letter10176643.9%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N9443892.8%
 
S69766.9%
 
U2340.2%
 
D1180.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o9455672.8%
 
t69765.4%
 
e69765.4%
 
a69765.4%
 
d69765.4%
 
y69765.4%
 
p2340.2%
 
w1180.1%
 
n1180.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin231672100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o9455640.8%
 
N9443840.8%
 
S69763.0%
 
t69763.0%
 
e69763.0%
 
a69763.0%
 
d69763.0%
 
y69763.0%
 
U2340.1%
 
p2340.1%
 
D1180.1%
 
w1180.1%
 
n1180.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII231672100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o9455640.8%
 
N9443840.8%
 
S69763.0%
 
t69763.0%
 
e69763.0%
 
a69763.0%
 
d69763.0%
 
y69763.0%
 
U2340.1%
 
p2340.1%
 
D1180.1%
 
w1180.1%
 
n1180.1%
 

rosiglitazone
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
95401 
Steady
 
6100
Up
 
178
Down
 
87
ValueCountFrequency (%) 
No9540193.7%
 
Steady61006.0%
 
Up1780.2%
 
Down870.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.241475542
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o9548841.9%
 
N9540141.8%
 
S61002.7%
 
t61002.7%
 
e61002.7%
 
a61002.7%
 
d61002.7%
 
y61002.7%
 
U1780.1%
 
p1780.1%
 
D87< 0.1%
 
w87< 0.1%
 
n87< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter12634055.4%
 
Uppercase Letter10176644.6%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N9540193.7%
 
S61006.0%
 
U1780.2%
 
D870.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o9548875.6%
 
t61004.8%
 
e61004.8%
 
a61004.8%
 
d61004.8%
 
y61004.8%
 
p1780.1%
 
w870.1%
 
n870.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin228106100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o9548841.9%
 
N9540141.8%
 
S61002.7%
 
t61002.7%
 
e61002.7%
 
a61002.7%
 
d61002.7%
 
y61002.7%
 
U1780.1%
 
p1780.1%
 
D87< 0.1%
 
w87< 0.1%
 
n87< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII228106100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o9548841.9%
 
N9540141.8%
 
S61002.7%
 
t61002.7%
 
e61002.7%
 
a61002.7%
 
d61002.7%
 
y61002.7%
 
U1780.1%
 
p1780.1%
 
D87< 0.1%
 
w87< 0.1%
 
n87< 0.1%
 

acarbose
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101458 
Steady
 
295
Up
 
10
Down
 
3
ValueCountFrequency (%) 
No10145899.7%
 
Steady2950.3%
 
Up10< 0.1%
 
Down3< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.011654187
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o10146149.6%
 
N10145849.6%
 
S2950.1%
 
t2950.1%
 
e2950.1%
 
a2950.1%
 
d2950.1%
 
y2950.1%
 
U10< 0.1%
 
p10< 0.1%
 
D3< 0.1%
 
w3< 0.1%
 
n3< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10295250.3%
 
Uppercase Letter10176649.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N10145899.7%
 
S2950.3%
 
U10< 0.1%
 
D3< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10146198.6%
 
t2950.3%
 
e2950.3%
 
a2950.3%
 
d2950.3%
 
y2950.3%
 
p10< 0.1%
 
w3< 0.1%
 
n3< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin204718100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o10146149.6%
 
N10145849.6%
 
S2950.1%
 
t2950.1%
 
e2950.1%
 
a2950.1%
 
d2950.1%
 
y2950.1%
 
U10< 0.1%
 
p10< 0.1%
 
D3< 0.1%
 
w3< 0.1%
 
n3< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII204718100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o10146149.6%
 
N10145849.6%
 
S2950.1%
 
t2950.1%
 
e2950.1%
 
a2950.1%
 
d2950.1%
 
y2950.1%
 
U10< 0.1%
 
p10< 0.1%
 
D3< 0.1%
 
w3< 0.1%
 
n3< 0.1%
 

miglitol
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101728 
Steady
 
31
Down
 
5
Up
 
2
ValueCountFrequency (%) 
No101728> 99.9%
 
Steady31< 0.1%
 
Down5< 0.1%
 
Up2< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.001316746
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o10173350.0%
 
N10172849.9%
 
S31< 0.1%
 
t31< 0.1%
 
e31< 0.1%
 
a31< 0.1%
 
d31< 0.1%
 
y31< 0.1%
 
D5< 0.1%
 
w5< 0.1%
 
n5< 0.1%
 
U2< 0.1%
 
p2< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10190050.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101728> 99.9%
 
S31< 0.1%
 
D5< 0.1%
 
U2< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10173399.8%
 
t31< 0.1%
 
e31< 0.1%
 
a31< 0.1%
 
d31< 0.1%
 
y31< 0.1%
 
w5< 0.1%
 
n5< 0.1%
 
p2< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203666100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o10173350.0%
 
N10172849.9%
 
S31< 0.1%
 
t31< 0.1%
 
e31< 0.1%
 
a31< 0.1%
 
d31< 0.1%
 
y31< 0.1%
 
D5< 0.1%
 
w5< 0.1%
 
n5< 0.1%
 
U2< 0.1%
 
p2< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203666100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o10173350.0%
 
N10172849.9%
 
S31< 0.1%
 
t31< 0.1%
 
e31< 0.1%
 
a31< 0.1%
 
d31< 0.1%
 
y31< 0.1%
 
D5< 0.1%
 
w5< 0.1%
 
n5< 0.1%
 
U2< 0.1%
 
p2< 0.1%
 

troglitazone
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101763 
Steady
 
3
ValueCountFrequency (%) 
No101763> 99.9%
 
Steady3< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000117918
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10176350.0%
 
o10176350.0%
 
S3< 0.1%
 
t3< 0.1%
 
e3< 0.1%
 
a3< 0.1%
 
d3< 0.1%
 
y3< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10177850.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101763> 99.9%
 
S3< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o101763> 99.9%
 
t3< 0.1%
 
e3< 0.1%
 
a3< 0.1%
 
d3< 0.1%
 
y3< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203544100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10176350.0%
 
o10176350.0%
 
S3< 0.1%
 
t3< 0.1%
 
e3< 0.1%
 
a3< 0.1%
 
d3< 0.1%
 
y3< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203544100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10176350.0%
 
o10176350.0%
 
S3< 0.1%
 
t3< 0.1%
 
e3< 0.1%
 
a3< 0.1%
 
d3< 0.1%
 
y3< 0.1%
 

tolazamide
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101727 
Steady
 
38
Up
 
1
ValueCountFrequency (%) 
No101727> 99.9%
 
Steady38< 0.1%
 
Up1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.001493623
Min length2

Overview of Unicode Properties

Unique unicode characters10
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10172749.9%
 
o10172749.9%
 
S38< 0.1%
 
t38< 0.1%
 
e38< 0.1%
 
a38< 0.1%
 
d38< 0.1%
 
y38< 0.1%
 
U1< 0.1%
 
p1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10191850.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101727> 99.9%
 
S38< 0.1%
 
U1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10172799.8%
 
t38< 0.1%
 
e38< 0.1%
 
a38< 0.1%
 
d38< 0.1%
 
y38< 0.1%
 
p1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203684100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10172749.9%
 
o10172749.9%
 
S38< 0.1%
 
t38< 0.1%
 
e38< 0.1%
 
a38< 0.1%
 
d38< 0.1%
 
y38< 0.1%
 
U1< 0.1%
 
p1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203684100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10172749.9%
 
o10172749.9%
 
S38< 0.1%
 
t38< 0.1%
 
e38< 0.1%
 
a38< 0.1%
 
d38< 0.1%
 
y38< 0.1%
 
U1< 0.1%
 
p1< 0.1%
 

examide
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101766 
ValueCountFrequency (%) 
No101766100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

Overview of Unicode Properties

Unique unicode characters2
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10176650.0%
 
o10176650.0%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter10176650.0%
 
Lowercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101766100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o101766100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203532100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10176650.0%
 
o10176650.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203532100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10176650.0%
 
o10176650.0%
 

citoglipton
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101766 
ValueCountFrequency (%) 
No101766100.0%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

Overview of Unicode Properties

Unique unicode characters2
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10176650.0%
 
o10176650.0%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter10176650.0%
 
Lowercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101766100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o101766100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203532100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10176650.0%
 
o10176650.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203532100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10176650.0%
 
o10176650.0%
 

insulin
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
47383 
Steady
30849 
Down
12218 
Up
11316 
ValueCountFrequency (%) 
No4738346.6%
 
Steady3084930.3%
 
Down1221812.0%
 
Up1131611.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length3.45266592
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o5960117.0%
 
N4738313.5%
 
S308498.8%
 
t308498.8%
 
e308498.8%
 
a308498.8%
 
d308498.8%
 
y308498.8%
 
D122183.5%
 
w122183.5%
 
n122183.5%
 
U113163.2%
 
p113163.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter24959871.0%
 
Uppercase Letter10176629.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N4738346.6%
 
S3084930.3%
 
D1221812.0%
 
U1131611.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5960123.9%
 
t3084912.4%
 
e3084912.4%
 
a3084912.4%
 
d3084912.4%
 
y3084912.4%
 
w122184.9%
 
n122184.9%
 
p113164.5%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin351364100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o5960117.0%
 
N4738313.5%
 
S308498.8%
 
t308498.8%
 
e308498.8%
 
a308498.8%
 
d308498.8%
 
y308498.8%
 
D122183.5%
 
w122183.5%
 
n122183.5%
 
U113163.2%
 
p113163.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII351364100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o5960117.0%
 
N4738313.5%
 
S308498.8%
 
t308498.8%
 
e308498.8%
 
a308498.8%
 
d308498.8%
 
y308498.8%
 
D122183.5%
 
w122183.5%
 
n122183.5%
 
U113163.2%
 
p113163.2%
 
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101060 
Steady
 
692
Up
 
8
Down
 
6
ValueCountFrequency (%) 
No10106099.3%
 
Steady6920.7%
 
Up8< 0.1%
 
Down6< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.027317572
Min length2

Overview of Unicode Properties

Unique unicode characters13
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o10106649.0%
 
N10106049.0%
 
S6920.3%
 
t6920.3%
 
e6920.3%
 
a6920.3%
 
d6920.3%
 
y6920.3%
 
U8< 0.1%
 
p8< 0.1%
 
D6< 0.1%
 
w6< 0.1%
 
n6< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10454650.7%
 
Uppercase Letter10176649.3%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N10106099.3%
 
S6920.7%
 
U8< 0.1%
 
D6< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10106696.7%
 
t6920.7%
 
e6920.7%
 
a6920.7%
 
d6920.7%
 
y6920.7%
 
p8< 0.1%
 
w6< 0.1%
 
n6< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin206312100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o10106649.0%
 
N10106049.0%
 
S6920.3%
 
t6920.3%
 
e6920.3%
 
a6920.3%
 
d6920.3%
 
y6920.3%
 
U8< 0.1%
 
p8< 0.1%
 
D6< 0.1%
 
w6< 0.1%
 
n6< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII206312100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o10106649.0%
 
N10106049.0%
 
S6920.3%
 
t6920.3%
 
e6920.3%
 
a6920.3%
 
d6920.3%
 
y6920.3%
 
U8< 0.1%
 
p8< 0.1%
 
D6< 0.1%
 
w6< 0.1%
 
n6< 0.1%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101753 
Steady
 
13
ValueCountFrequency (%) 
No101753> 99.9%
 
Steady13< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000510976
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10175350.0%
 
o10175350.0%
 
S13< 0.1%
 
t13< 0.1%
 
e13< 0.1%
 
a13< 0.1%
 
d13< 0.1%
 
y13< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10181850.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101753> 99.9%
 
S13< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o10175399.9%
 
t13< 0.1%
 
e13< 0.1%
 
a13< 0.1%
 
d13< 0.1%
 
y13< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203584100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10175350.0%
 
o10175350.0%
 
S13< 0.1%
 
t13< 0.1%
 
e13< 0.1%
 
a13< 0.1%
 
d13< 0.1%
 
y13< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203584100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10175350.0%
 
o10175350.0%
 
S13< 0.1%
 
t13< 0.1%
 
e13< 0.1%
 
a13< 0.1%
 
d13< 0.1%
 
y13< 0.1%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101765 
Steady
 
1
ValueCountFrequency (%) 
No101765> 99.9%
 
Steady1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000039306
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10177050.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101765> 99.9%
 
S1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o101765> 99.9%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203536100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203536100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101764 
Steady
 
2
ValueCountFrequency (%) 
No101764> 99.9%
 
Steady2< 0.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000078612
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10176450.0%
 
o10176450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10177450.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101764> 99.9%
 
S2< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o101764> 99.9%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203540100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10176450.0%
 
o10176450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203540100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10176450.0%
 
o10176450.0%
 
S2< 0.1%
 
t2< 0.1%
 
e2< 0.1%
 
a2< 0.1%
 
d2< 0.1%
 
y2< 0.1%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
101765 
Steady
 
1
ValueCountFrequency (%) 
No101765> 99.9%
 
Steady1< 0.1%
 
Frequencies of value counts

Unique

Unique1 ?
Unique (%)< 0.1%
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.000039306
Min length2

Overview of Unicode Properties

Unique unicode characters8
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10177050.0%
 
Uppercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N101765> 99.9%
 
S1< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o101765> 99.9%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203536100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203536100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N10176550.0%
 
o10176550.0%
 
S1< 0.1%
 
t1< 0.1%
 
e1< 0.1%
 
a1< 0.1%
 
d1< 0.1%
 
y1< 0.1%
 

change
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
No
54755 
Ch
47011 
ValueCountFrequency (%) 
No5475553.8%
 
Ch4701146.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

Overview of Unicode Properties

Unique unicode characters4
Unique unicode categories2 ?
Unique unicode scripts1 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5475526.9%
 
o5475526.9%
 
C4701123.1%
 
h4701123.1%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter10176650.0%
 
Lowercase Letter10176650.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N5475553.8%
 
C4701146.2%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o5475553.8%
 
h4701146.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin203532100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5475526.9%
 
o5475526.9%
 
C4701123.1%
 
h4701123.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII203532100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5475526.9%
 
o5475526.9%
 
C4701123.1%
 
h4701123.1%
 
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
Yes
78363 
No
23403 
ValueCountFrequency (%) 
Yes7836377.0%
 
No2340323.0%
 

readmitted
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size795.2 KiB
NO
54864 
>30
35545 
<30
11357 
ValueCountFrequency (%) 
NO5486453.9%
 
>303554534.9%
 
<301135711.2%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length3
Median length2
Mean length2.460880844
Min length2

Overview of Unicode Properties

Unique unicode characters6
Unique unicode categories3 ?
Unique unicode scripts2 ?
Unique unicode blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
N5486421.9%
 
O5486421.9%
 
34690218.7%
 
04690218.7%
 
>3554514.2%
 
<113574.5%
 

Most occurring categories

ValueCountFrequency (%) 
Uppercase Letter10972843.8%
 
Decimal Number9380437.5%
 
Math Symbol4690218.7%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
N5486450.0%
 
O5486450.0%
 

Most frequent Math Symbol characters

ValueCountFrequency (%) 
>3554575.8%
 
<1135724.2%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
34690250.0%
 
04690250.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common14070656.2%
 
Latin10972843.8%
 

Most frequent Latin characters

ValueCountFrequency (%) 
N5486450.0%
 
O5486450.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
34690233.3%
 
04690233.3%
 
>3554525.3%
 
<113578.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII250434100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
N5486421.9%
 
O5486421.9%
 
34690218.7%
 
04690218.7%
 
>3554514.2%
 
<113574.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

encounter_idpatient_nbrracegenderageweightadmission_type_iddischarge_disposition_idadmission_source_idtime_in_hospitalpayer_codemedical_specialtynum_lab_proceduresnum_proceduresnum_medicationsnumber_outpatientnumber_emergencynumber_inpatientdiag_1diag_2diag_3number_diagnosesmax_glu_serumA1Cresultmetforminrepaglinidenateglinidechlorpropamideglimepirideacetohexamideglipizideglyburidetolbutamidepioglitazonerosiglitazoneacarbosemiglitoltroglitazonetolazamideexamidecitogliptoninsulinglyburide-metforminglipizide-metforminglimepiride-pioglitazonemetformin-rosiglitazonemetformin-pioglitazonechangediabetesMedreadmitted
022783928222157CaucasianFemale[0-10)NaN62511NaNPediatrics-Endocrinology4101000250.83NaNNaN1NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNO
114919055629189CaucasianFemale[10-20)NaN1173NaNNaN59018000276250.012559NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYes>30
26441086047875AfricanAmericanFemale[20-30)NaN1172NaNNaN11513201648250V276NaNNaNNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYesNO
350036482442376CaucasianMale[30-40)NaN1172NaNNaN441160008250.434037NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
41668042519267CaucasianMale[40-50)NaN1171NaNNaN51080001971572505NaNNaNNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
53575482637451CaucasianMale[50-60)NaN2123NaNNaN316160004144112509NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYes>30
65584284259809CaucasianMale[60-70)NaN3124NaNNaN70121000414411V457NaNNaNSteadyNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
763768114882984CaucasianMale[70-80)NaN1175NaNNaN730120004284922508NaNNaNNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYes>30
81252248330783CaucasianFemale[80-90)NaN21413NaNNaN68228000398427388NaNNaNNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO
91573863555939CaucasianFemale[90-100)NaN33412NaNInternalMedicine333180004341984868NaNNaNNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoSteadyNoNoNoNoNoChYesNO

Last rows

encounter_idpatient_nbrracegenderageweightadmission_type_iddischarge_disposition_idadmission_source_idtime_in_hospitalpayer_codemedical_specialtynum_lab_proceduresnum_proceduresnum_medicationsnumber_outpatientnumber_emergencynumber_inpatientdiag_1diag_2diag_3number_diagnosesmax_glu_serumA1Cresultmetforminrepaglinidenateglinidechlorpropamideglimepirideacetohexamideglipizideglyburidetolbutamidepioglitazonerosiglitazoneacarbosemiglitoltroglitazonetolazamideexamidecitogliptoninsulinglyburide-metforminglipizide-metforminglimepiride-pioglitazonemetformin-rosiglitazonemetformin-pioglitazonechangediabetesMedreadmitted
101756443842070140199494OtherFemale[60-70)NaN1172MDNaN466171119965854039NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYes>30
101757443842136181593374CaucasianFemale[70-80)NaN1175NaNNaN211160014915185119NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYesNO
101758443842340120975314CaucasianFemale[80-90)NaN1175MCNaN7612201029283049NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
10175944384277886472243CaucasianMale[80-90)NaN1171MCNaN10153004357842507NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
10176044384717650375628AfricanAmericanFemale[60-70)NaN1176DMNaN451253123454384129NaNNaNNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoDownNoNoNoNoNoChYes>30
101761443847548100162476AfricanAmericanMale[70-80)NaN1373MCNaN51016000250.132914589NaN>8SteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoDownNoNoNoNoNoChYes>30
10176244384778274694222AfricanAmericanFemale[80-90)NaN1455MCNaN333180015602767879NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYesNO
10176344385414841088789CaucasianMale[70-80)NaN1171MCNaN53091003859029613NaNNaNSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoDownNoNoNoNoNoChYesNO
10176444385716631693671CaucasianFemale[80-90)NaN23710MCSurgery-General452210019962859989NaNNaNNoNoNoNoNoNoSteadyNoNoSteadyNoNoNoNoNoNoNoUpNoNoNoNoNoChYesNO
101765443867222175429310CaucasianMale[70-80)NaN1176NaNNaN13330005305307879NaNNaNNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNO